The bavarian archive for speech signals: resources for the speech community
نویسندگان
چکیده
This paper gives an overview of the activities at the Bavarian Archive of Speech Signals (BAS) that was founded as a non-pro t organization in 1995. The main purpose of BAS is the development of a Complete Phonetic Theory (CPT) of German based on the empirical exploitation of very large databases of spoken German. However, on our way to that goal BAS will act as a focal point for all computer readable speech resources in the German language and distribute these resources to the speech community. These resources are intended to cover the speech part of the German language, i.e. speech data, labeling and segmentations, knowledge about pronunciation. In the following we give a concise overview of what resources are presently available at BAS, how they were produced, how they can be obtained from BAS, how we use these resources in various scienti c activities and a brief summary of ongoing projects.
منابع مشابه
Bavarian Archive for Speech Signals ( Bas ) Status Report 1995 - 2000
Outline of this Report The Bavarian Archive for Speech Signals (BAS) is a joint initiative of the Bavarian State and the Ludwig Maximilians Universität München. It is located at the host organisation Institut für Phonetik und Sprachliche Kommunikation and collects, evaluates, produces and disseminates speech based resources to the scientific community. Our focus is the German language covering ...
متن کاملThree New Corpora at the Bavarian Archive for Speech Signals - and a First Step Towards Distributed Web-Based Recording
The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial and academic use: a) Hempels Sofa contains recordings of up to 60 seconds of non-scripted telephone speech, b) ZipTel is a corpus with telephone speech covering postal addresses and telephone numbers from a real world application, and c) RVG-J, an extension of the original Regional Variants of Germ...
متن کاملSpeech and Speech Related Resources at BAS
The Bavarian Archive for Speech Signals BAS located at the Ludwig Maximilians Universit at M unchen Ger many collects evaluates produces and disseminates Ger man speech resources to the scienti c community Our focus is the German language covering a large geographi cal part of central Europe Speech and speech related resources are usually produced for certain tasks or projects Therefore it is n...
متن کاملThe SmartKom Multimodal Corpus at BAS
In this contribution we announce and describe in detail the new multimodal corpus evolving from the publicly funded German SmartKom project. The first release of the corpus (BAS SK-P 1.0) has been finished end of 2001 and will be ready for distribution to the scientific community in July 2002. The SmartKom corpus will be the first of a new generation of Language Resources (LR) designed for a mo...
متن کاملAlcohol language corpus: the first public corpus of alcoholized German speech
The Alcohol Language Corpus (ALC) is the first publicly available speech corpus comprising intoxicated and sober speech of 162 female and male German speakers. Recordings are done in the automotive environment to allow for the development of automatic alcohol detection and to ensure a consistent acoustic environment for the alcoholized and the sober recording. The recorded speech covers a varie...
متن کامل